Unsupervised Ontology Population Using Latent Semantic Analysis

نویسندگان

  • Theerayut Thongkrau
  • Pattarachai Lalitrojwong
چکیده

A large ontology such as lexical ontology is useful as the basic knowledge base in artificial intelligence and computational linguistics application. However, it is insufficient to recognize only existing instances for each concept. Adding new instances into the lexical ontology will expand knowledge in the system. In this paper, we propose an efficient unsupervised ontology population system that classifies new instances into a corresponding lexical ontology concept. Compared to previous related works, it does not require manual preprocessing to prepare training data. In terms of processing time, it does not need to search for many concepts in the lexical ontology. Our system employs latent semantic analysis together with context voting to find the appropriate concept of the instance. In sum, the system achieves higher accuracy when the lexical ontology contains a lot of concepts, which generally occurs in practical problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Presentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures

Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...

متن کامل

Presentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures

Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...

متن کامل

Combining Rule-Based Methods and Latent Semantic Analysis for Ontology Structure Construction

We report on a pilot study of techniques for organizing terms derived from relevant corpora into semantic clusters. We combine rule-based techniques and an unsupervised statistical method to semantically cluster terms extracted from medical reports. The rule-based approach identifies lexicosyntactic patterns suggestive of parent-child relations and coordination structures suggestive of sibling-...

متن کامل

Unsupervised structured semantic inference for spoken dialog reservation tasks

This work proposes a generative model to infer latent semantic structures on top of manual speech transcriptions in a spoken dialog reservation task. The proposed model is akin to a standard semantic role labeling system, except that it is unsupervised, it does not rely on any syntactic information and it exploits concepts derived from a domain-specific ontology. The semantic structure is obtai...

متن کامل

Terminological ontology learning and population using latent Dirichlet allocation

The success of Semantic Web will heavily rely on the availability of formal ontologies to structure machine understanding data. However, there is still a lack of general methodologies for ontology automatic learning and population, i.e. the generation of domain ontologies from various kinds of resources by applying natural language processing and machine learning techniques In this paper, the a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010